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Summary 

A mechanism of innate antiviral immunity operating against viruses infecting mammalian cells 
has been described during the last decade. Host cytidine deaminases (e.g., APOBEC3 proteins) 
edit viral genomes giving raise to hypermutated nonfunctional viruses; consequently, viral 
fitness is reduced through lethal mutagenesis. By contrast, sub-lethal hypermutagenesis may 
contribute to virus evolvability by increasing population diversity. To prevent genome editing, 
some viruses have evolved proteins that mediate APOBEC3 degradation. The model plant 
Arabidopsis thaliana encodes for nine cytidine deaminases (AtCDAs), raising the question of 
whether deamination is an antiviral mechanism in plants as well. Here we tested the effects of 
AtCDAs expression on the pararetrovirus Cauliflower mosaic virus (CaMV). We show that A. 
thaliana AtCDAl gene product exerts a mutagenic activity, which indeed generates a negative 
correlation between the level of AtCDAl expression and CaMV accumulation in the plant, 
suggesting that deamination may also work as an antiviral mechanism in plants. 
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Human APOBEC family includes enzymes that catalyze the hydrolytic deamination of cytidine 
to uridine or deoxycytidine to deoxyuridine. This family is composed by eleven known 
members: APOBEC 1, APOBEC2, APOBEC3 (further classified as A3A to A3H), APOBEC4, 
and AID (activation induced deaminase). APOBEC proteins are associated to several functions 
involving editing of DNA or RNA (reviewed in ref. 1). APOBEC 1 (apolipoprotein B editing 
catalytic subunit 1) mediates deamination of cytidine at position 6666 of apolipoprotein B 
mRNA, resulting in the introduction of premature stop codon and the production of the short 
form of the protein 2 " 4 . APOBEC2 is essential for muscle tissue development 5 . APOBEC4 has 
no ascribed function so far 5 . AID deaminates genomic ssDNA of B cells, initiating 
immunoglobulin somatic hypermutation and class switch processes 7 " 9 . Most notably, 
APOBEC3 enzymes participate in innate immunity against retroviruses and endogenous 
retroelements 10 " 12 . Sheehy et al. demonstrated that A3G also plays a role in immunity against 
Human immunodeficiency virus type 1 (HIV-1) 13 . For its antiviral role A3G is packaged along 
with viral RNA 14 . Upon infection of target cells and during the reverse transcription process, 
A3G deaminates the cytosine residues of the nascent first retroviral DNA strand into uraciles. 
The resulting uracil residues serve as templates for the incorporation of adenine, which at the 
end result in strand-specific C/G to T/A transitions and loss of infectivity through lethal 
mutagenesis 15 " 19 . At the other hand, a sub-lethal mutagenic activity of APOBEC3 proteins may 
contribute as an additional source for HIV-1 genetic diversity, hence bolstering its 
evolvability 20 " 22 . APOBEC3 proteins had been shown to inhibit other retroviruses {Simian 
immunodeficiency virus 23 , Equine infectious anemia virus 24 , Foamy virus 25 , Human T-cell 

26 27 28 

leukemia virus , and Murine leukemia virus ), pararetroviruses (Hepatitis B virus ) and DNA 
viruses (Herpes simplex virus 1 (HSV-1) 29 ' 30 and Epstein-Barr virus (EBV) 30 ). In the cases of 
HSV-1 and EBV antiviral role of deaminase has not been demonstrated 30 . Evidences also exist 
that A3G significantly interferes with negative-strand RNA viruses lacking a DNA replicative 
phase 31 . For example, the transcription and protein accumulation of Measles virus, Mumps 
virus and Respiratory syncytial virus was reduced 50 to 70% whereas the frequency of C/G to 
U/A mutations was ~4-fold increased 31 . In contrast, A3G plays no antiviral activity against 
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Influenza A virus despite being highly induced in infected cells as part of a general IFN(3 
response to infection 32 ' 33 . 

Human APOBEC belongs to a superfamily of polynucleotide cytidine and deoxycytidine 
deaminases distributed throughout the biological world 34 . All family members contain a zinc 
finger domain (CDD), identifiable by the signature (H/C)-x-E-x25-30P-C-x-x-C. Plants are not 
an exception and, for example, Arabidopsis thaliana encodes for nine putative cytidine 
deaminases (named as AtCDAl to AtCDA9 genes). Whilst the AtCDAl gene is located in 
chromosome II, the other eight genes are located in chromosome IV. In the case of rice and 
other monocots, only a CDA has been identified 34 . Interestingly, this CDA expression was 
highly induced as part of the general stress response of rice against the infection of the fungal 
pathogen Magnaporthe grisea, resulting in an excess of A to G and U to C mutations in 
defense-related genes 35 . Edited dsRNAs might be retained in the nucleus and degraded, 
generating miRNAs and siRNAs 36 . Provided the presence of the catalytic deamination motif in 
AtCDA proteins and the relevance of deamination as an antiviral innate response in animals, we 
sought first to determine whether any of the AtCDAs can participate in deaminating the genome 
of the pararetrovirus Cauliflower mosaic virus (CaMV; Caulimovirus, Caulimoviridae) and, 
second, to explore whether this deamination may negatively impact viral infection. We 
hypothesize that deamination may take place mainly at the reverse transcription step. CaMV 
genome is constituted by a single molecule of circular double-stranded DNA of 8 kpb 37 . The 
DNA of CaMV has three discontinuities, Al in the minus-sense strand (or a strand) and A2 and 
A3 in the plus-sense strand (yielding the b and g strands). In short, the replication cycle of 
CaMV is as follows 37 : in the nucleus of the infected cell, the a strand is transcribed into a 35S 
RNA, with terminal repeats, that migrates to the cytoplasm. Priming of the 35 S RNA occurs by 
the annealing of the 3' end of tRNA met to the primer-binding site (PBS) sequence, leading to the 
synthesis of the DNA a strand by the virus' reverse transcriptase. Then, the RNA in the 
heteroduplex is degraded by the virus' RNaseH activity, leaving purine-rich regions that act as 
primers for the synthesis of the plus-sense DNA b and g strands. 
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Our results show that AtCDAl significantly increases the number of G to A mutations in 
vivo, and that there is a negative correlation between the amount of AtCDAl mRNA present in 
the cell and the load reached by CaMV, suggesting that deamination of viral genomes may also 
constitute a significant antiviral mechanism in plants. 

Results 

Effect of AtCDAs overexpression on CaMV mutational spectrum. To test the mutagenic 
activity of A. thaliana CDAs, nine Nicotiana bigelovii plants were inoculated with CaMV. 
After systemic infection was established, we performed transitory AtCDA overexpression 
experiments. To do so, the same leaf was agroinfiltrated twice; half leaf was infiltrated with 
one of the nine AtCDA genes and the other half leaf was infiltrated with the empty vector. This 
test was done for all nine AtCDA genes in different plants. The presence of AtCDA mRNAs 
was verified by RT-PCR from DNase-treated RNA extracts. DNA was extracted from 
agroinfiltrated areas for 3D-PCR amplification of a 229 bp fragment in the ORF VII of CaMV. 
3D-PCR uses a gradient of low denaturation temperatures during PCR to identify the lowest 
one, which potentially allows differential amplification of A/T rich hypermutated genomes 38 . 
There were no differences in the lowest denaturation temperature resulting in amplification 
between the controls and the AtCDA-agroinfiltrated samples, suggesting that hypermutated 
genomes should be at low frequency, if present at all, in the latter case. 

PCR products obtained at the lowest denaturation temperature were cloned and 
sequenced. In a preliminary experiment we sequenced 25 clones from each AtCDA/negative 
control pair (Supplementary Table SI). At least one G to A transition was detected in clones 
from areas infiltrated with AtCDAl, AtCDA2 and AtCDA9 genes. For these three genes, we 
further increased the number of sequenced clones up to 106. The CaMV mutant spectra was 
significantly different between plants overexpressing AtCDAl and their respective negative 
controls (Fig. la: ^ = 25.760, 7 d.f, P = 0.001), being the difference entirely driven by the 
471.43% increase in G to A transitions observed in the plants overexpressing AtCDAl. A 
thorough inspection of alignments showed that most of the G to A mutations (65.6%) detected 
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in the different samples were located at the nucleotide position 181 (Supplementary Table SI). 
By contrast, no overall difference existed between the mutant spectra of CaMV populations 
replicating in plants overexpressing At CD A 2 (Fig. lb: = 8.944, 6 d.f, P = 0.177) or AtCDA9 
genes (Fig. lc: % 2 = 6.539, 8 d.f, P = 0.587) and their respective controls. Consistently, the 
mutant spectra from the three AtCDA-overexpressed samples were significantly heterogeneous 
(% 2 = 41.063, 16 d.f, P = 0.001), again due to the enrichment in G to A transitions observed in 
the case of AtCDAl. By contrast, the three independent control inoculation experiments 
showed homogeneous mutant spectra for CaMV (j( = 14.605, 18 d.f, P = 0.689) and 
undistinguishable from the mutant spectra previously reported for natural isolates of this virus 39 . 
This later comparison confirms the stability of CaMV mutant spectrum when not perturbed by 
e.g. host mutagenic enzymes and the host species is always the same. 

Therefore, we conclude that overexpressing the AtCDAl gene resulted in a significant 
shift in CaMV genome composition towards G to A mutations, as expected from a cytidine 
deaminase hypermutagenic activity. 

Effect of suppressing AtCDA expression on the viral load and mutational spectrum of 
CaMV. To test the effects of suppressing the expression of AtCDA on viral accumulation we 
produced a transgenic line of A. thaliana Col-0, named miRl-6-3. This line was stably 
transformed with an artificial micro-RNA (amiR), controlled by the Aspergillus nidulans 
ethanol regulon to achieve ethanol-triggered RNAi-mediated simultaneous suppression of 
AtCDAs 1, 2, 3, 4, 7, and 8 expression. Transgenic and wild-type plants were subjected to 
periodical treatment with 2% ethanol (or water for the control groups). Subsequently, plants 
were inoculated with the infectious clone pCaMVW260 that expresses the genome of CaMV. 
Samples were taken eight days after inoculation and AtCDAl mRNA and CaMV viral load were 
quantified by real time RT-qPCR and qPCR, respectively, in the same samples. For each 
genotype and/or treatment, 22 plants were analyzed. 
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The expression of AtCDAl mRNA depended on the plant genotype (Fig. 2a; GLM: ^ = 
28.085, 1 d.f., P < 0.001) as well as on the interaction of plant genotype and treatment (% 2 = 
26.037, 1 d.f., P < 0.001), suggesting a differential accumulation of AtCDAl mRNA on each 
plant genotype depending on the amiR induction state. Ethanol treatment reduced the amount 
of AtCDAl mRNA by 24.01% in transgenic plants, proving that triggering the expression of the 
amiR significantly and efficiently silences the expression of AtCDAl . Unexpectedly, the effect 
was the opposite in wild-type plants, for which we observed 23.76% increase in AtCDAl 
mRNA accumulation (Fig. 2a) upon treatment with ethanol. This increase in the expression of 
AtCDAl in wild-type plants after the ethanol treatment is certainly puzzling and the underlying 
mechanisms deserve investigation by its own. However, for the purpose of this study, its 
relevance is that it may increase the number of G to A mutations in CaMV genome, thus 
making the antiviral effect stronger to some extent. 

More interestingly, the relative accumulation of CaMV in ethanol-treated plants was 
significantly different depending on the plant genotype being infected (Fig. 2b; Mann- Whitney 
U test, P = 0.002): silencing the AtCDAl gene bolstered CaMV accumulation in 103.10% 
compared to the accumulation observed in wild-type plants. Furthermore, a significant negative 
correlation between the number of molecules of AtCDAl mRNA and viral load was detected 
(partial correlation coefficient controlling for treatment: r = -0.299, 86 d.f, P = 0.005). 

Provided the significant increase of viral load in plants with lower levels of AtCDAl 
mRNA, we sought for the molecular signature of deamination in transgenic plants. For this, we 
selected three biological replicates from each treatment group (ethanol or control) and 
sequenced (as previously explained) from 39 to 45 clones of the CaMV fragment from each 
replicate. As shown in Fig. 3, silencing of the AtCDAl gene affects the composition of CaMV 
mutant spectrum by reducing the number of G to A transitions in 69.23%. Nonetheless, despite 
this reduction goes in the expected direction, overall, both mutational spectra were not 
significantly different (Fig. 3: % 2 = 9.108, 6 d.f, P = 0.168), prompting caution against making a 
strong conclusion on the role of deamination in the observed increase in CaMV accumulation. 



7 



Downloaded from http://biorxiv.org/ on September 18, 2014 



Therefore, we conclude that suppressing simultaneously the expression of the AtCDAs 1, 
2, 3, 4, 7, and 8 significantly reduces the accumulation of CaMV. However, the 
characterization of the mutant spectrum of the same CaMV populations provides no strong 
enough support to the cytidine deamination hypothesis. 

Discussion 

Lethal mutagenesis through deamination of RNA/DNA by cytidine deaminases has been proven 
to work as an antiviral mechanism against retroviruses 16 " 19 ' 23 " 27 , and some DNA 28 " 30 and RNA 31 
viruses infecting mammals. Our results show that A. thaliana CDA1 gene has some degree of 
mutagenic activity on the pararetrovirus CaMV genome. Moreover, simultaneous suppression 
of the expression of a subset of AtCDAs, including AtCDAl, increased CaMV load, strongly 
suggesting an antiviral role for AtCDAs. 

Our data show that AtCDAs probably restrict CaMV replication through a process similar 
to the restriction of HIV-1 by APOBEC3. CaMV replicates in the cytoplasm by reverse 
transcription using the plus-strand 35 S RNA as template. As for HIV-1, first strand negative 
cDNA could be deaminated during reverse transcription, transforming deoxycytidine into 
deoxyuridine. Then, when the plus strand is produced an adenine is incorporated instead of a 
guanine, increasing the proportion of G to A mutations. In the case of HIV-1, this G to A 
mutational bias is explained by A3G and A3H specificity for single negative stranded DNA 
because during HIV-1 replication. Cytosine to guanine transitions are rare and restricted to the 
PBS site and U3 region in the 5' long terminal repeat where plus stranded DNA is predicted to 
become transiently single stranded 40 . Similarly, during CaMV replication the negative strand 
remains single stranded, while the positive is copied from it and remains double stranded 41 . 
Surprisingly, for AtCDAl C to T mutations were also increased; the region studied here is close 
to the 5' end of CaMV, which contains the PBS for minus strand synthesis and the ssDNA 
discontinuity Al. The observed C to T transitions could reflect transient positive stranded 
ssDNA in the 5' terminal region during reverse transcription, nevertheless a different substrate 
specificity of A. thaliana CDAs cannot be ruled out. 
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Most of the G to A transitions detected in agroinfiltration experiments were located in the 
G at position 181. HIV- 1 hypermutated genomes show mutational hot spots as well, which are 
due to preference of A3G and A3F for deamination of the third cytidine in 5'-CCC (minus 
strand) and 5'-GGC, respectively 42 ' 43 . The context of the cytidine complementary to G181 (5'- 
GGC) differs from those described for APOBEC3, suggesting that if A. thaliana CDAs had 
context preference, it would be different to A3G. However, the low number of mutations found 
does not allow concluding a possible context preference for AtCDAs. Since our experiments 
are performed in vivo, negative selection is expected to purge genomes carrying deleterious 
mutations. This limitation could account for our failure to detect largely hypermutated genomes 
and makes necessary to develop new selection-free assays to further characterize AtCDA- 
induced mutagenesis. 

Although there is not a demonstrated correlation between the expression of APOBEC3 
and mutational bias of viruses infecting mammals, caulimoviruses have an excess of G to A 
transition in synonymous positions 44 . In A. thaliana plants, we found that silencing of AtCDAl 
reduced the frequency of G to A transitions in CaMV genome, suggesting a contribution of 
AtCDAs to the nucleotide bias found in caulimoviruses. The increased viral load in CDA- 
silenced A. thaliana plants strongly suggest that deamination of viral genomes may work as an 
antiviral mechanism in plants, opening questions about how general this mechanism might be, 
and also about what its contribution to viral evolution shall be. Describing a new natural 
antiviral mechanism in plants opens new research avenues for the development of new durable 
control strategies. 

Methods 

Transient overexpression of AtCDAs in N. bigelovii plants infected with CaMV. AtCDAs 
cDNAs were cloned under the 35S promoter in pBIN61 vector 45 . N. bigelovii plants were 
inoculated with CaMV virions purified from Brassica rapa plants 46 previously infected with the 
clone pCaMVW260 47 . Symptomatic leafs were agroinfiltrated 45 with one of the nine AtCDAs 
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and with the empty vector pBin61, each on one half leaf. Samples were collected three days 
post-agroinfiltration. 

Inducible cosuppression of multiple AtCDAs expression by RNAi. The design and cloning 
of the amiR able to simultaneously suppress the expression of AtCDAs 1, 2, 3, 4, 7, and 8 was 
performed as described in ref. 48. The amiRNA was cloned under the control of A. nidulans 
ethanol regulon 49 ' 50 and used to transform A. thaliana by the floral dip method 51 . By doing so, 
we obtained the transgenic line miR 1-6-3. One-month-old seedlings of transgenic and wild- 
type A. thaliana were treated with 2% ethanol (or water for the control groups) three times 
every four days. Three days after the third treatment, plants were inoculated with the infectious 
clone pCaMVW260 as described in ref. 47. Infections were established by applying 1.31x10 
molecules of pCaMVW260 to each of three leaves per plant. Subsequently, plants were 
subjected to two additional treatments with 2% ethanol (or water) one and five days post- 
infection. Finally, samples were taken eight days after inoculation and handled as previously 
described 51 . For each genotype (transgenic or wild-type) and treatment (ethanol or water) 
combination, 22 plants were analyzed. 

Detection of A/T enriched genomes. CaMV genomic DNA was purified using DNeasy Plant 
Mini Kit (Qiagen) according to manufacturer's instructions. For detection of edited genomes 
3D-PCR was performed using primers HCa8Fdeg and HCa8Rdeg (Supplementary Table 2). 
PCRs were performed in a Mastercycler* (Eppendorf) at denaturation temperatures 82.1 °C, 82.9 
°C, 83.9 °C, and 85.0 °C. PCR products obtained with the lowest denaturation temperature were 
cloned in pUC19 vector (Fermentas), transformed in Escherichia coli DH5a and sent to 
GenoScreen (Lille, France) for sequencing. 

RT-qPCR analysis of AtCDAl mRNA and qPCR analysis of CaMV load in transgenic 
plants. Total RNA was extracted from A. thaliana plants using the RNeasy* Plant Mini Kit 
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(Qiagen), according to manufacturer's instructions. AtCDAl specific primers qCDAl-F and 
qCDAl-R were designed using Primer Express software (Applied Biosystems). RT-qPCR 
reactions were performed using the One Step SYBR PrimeScript RT-PCR Kit II (Takara). 
Amplification, data acquisition and analysis were carried out using an Applied Biosystems 
Prism 7500 sequence detection system. All quantifications were performed using the standard 
curve method. To quantify AtCDAl mRNA, a full-ORF runoff transcript was synthetized with 
T7 RNA polymerase (Roche) using as template a PCR product obtained from cloned AtCDAl 
and primers T7-CDA1F and qCDAl-R. CaMV qPCR quantitation was performed as described 
in ref. 5 1 . 

Primers. All primers used are listed in Supplementary Table 2. 
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Figure Legends 

Figure 1 | Number of mutations in CaMV genomes isolated from plant tissues 
agroinfdtrated with different CDAs. (A) AtCDAl, (B) AtCDA2 and (C) AtCDA9. The 
pBIN61 empty vector was agroinfiltrated in the same leaves than their corresponding CDAs 
(mock). For each sample 20034 nucleotides were sequenced. 
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Figure 2 | Accumulation of CDA1 mRNA molecules and CaMV genomes, (a) Number of 
AtCDAl mRNA molecules/80 ng total RNA quantified by RT-qPCR using the standard curve 
method for absolute quantification, (b) Number of CaMV genomes/80 ng total DNA. For each 
block of plants (wild-type and amiRl-6-3), values were normalized to the average number of 
genomes estimated in the corresponding water-treated (control) plants. 
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Figure 3 | Number of mutations found pooling the CaMV sequences from ethanol-treated 
and control amiRl-6-3 plants (3 replicates). The number of nucleotides sequenced was 
23436 for control and 24003 for ethanol-treated plants. Ethanol-treated plants turn on the 
expression of amiR 1-6-3 that was designed to silence the expression of the AtCDAl gene. 
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